The Phase Diagram of Random Heteropolymers 
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We propose a new analytic approach to study the phase diagram of random heteropolymers, 
based on the cavity method. For copolymers we analyze the nature and phenomenology of the glass 
transition as a function of sequence correlations. Depending on these correlations, we find that 
two different scenarios for the glass transition can occur. We show that, beside the much studied 
possibility of an abrupt freezing transition at low temperature, the system can exhibit, upon cooling, 
a first transition to a soft glass phase with fully broken replica symmetry and a continuously growing 
degree of freezing as the temperature is lowered. 
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The statistical mechanics of random heteropolymers 
has attracted a lot of interest in the past two decades, 
mainly for its close relation to the long standing prob- 
lem of protein folding Q , but also because of its applica- 
tions to material sciences • It has soon been recognized 
that the energy landscape and low temperature physics 
of heteropolymers bear a strong resemblance with that 
of spin glasses, and many theoretical concepts have been 
fruitfully adapted to that particular class of disordered 
systems HQ. 

A popular guiding idea in this field was the conjec- 
ture that typical low energy configurations of a random 
heteropolymer behave as independent energy levels 
as it happens in the random energy model (REM) 
Mean field replica calculations corroborate such a dis- 
continuous glass transition with one step of replica sym- 
metry breaking, both for random bond interactions 1^ 
and for copolymers However, the applicability of the 
simple REM-like picture to heteropolymers and proteins 
has been questioned 0. The numerical stud y o f self- 
consistent dynamical equations for short chains [lOj hints 
at a more refined scenario which we shall compare with 
our own results below. 

The polymer freezing manifests itself in persisting cor- 
relations between two times ty^, and + t, after the 
molecule has been prepared in a random conforma- 
tion at time The following family of correla- 
tion functions for a chain of length N proves very use- 
ful [13: Ci{t,t^) = iV-i^^(Ar,,i(i) ' An,i(i„)) . Here 
Ari^i = Tij^i — Ti and is the spatial position of the i-th 
monomer of the chain. Within the glassy phase, these 
correlation functions should exhibit a plateau at qi = 

limt_,oo limf^^oo C'K^' ^ui) ^^.'^ are much smaller 
than the equilibration time Teq). 

The REM-like scenario predicts an abrupt freezing in a 
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metastable conformation, implying a discontinuous jump 
of the plateau height qi from zero to a finite value at the 
glass transition. However, many other disordered sys- 
tems, like spin glasses, exhibit a continuous glass transi- 
tion (in the "overlap" parameter, qi). This has rece ntly 
been predicted for the random amphiphilic chain |l2l |. 
as does our analysis for a certain class of correlated se- 
quences. The folding properties of a given heteropolymer 
will be crucially affected by the type of glass transition it 
undergoes, since the dynamics (e.g., the a ging behavior) 
are rather different in these two scenarios 

Apart from numerical simulations, most of our fun- 
damental knowledge about heteropolymers relies on two 
main analytical methods: the replica a^pproach, with 
some kind of Gaussian variational Ansatz |3 , and the use 
of dynamical e quat ions, with some self-consistent closure 
approximation [lO|, [iSj . Considering that these methods 
use some rather severe approximations which are not re- 
ally independent from each other, it is clearly useful to 
have a new analytic method emerging. We propose here 
such a new approach, based on the cavity method )l4l |. 
which is an extension of the classical Bethe approxima- 
tion to frustrated systems. It is the simplest mean-field 
approximation that handles exactly the short distance 
correlations and in this respect it is very different 
from the usual replica approach. Moreover, as we will see 
below, the existence of a non-trivial statistical mechanics 
model on a 'Bethe lattice' for which the method is exact 
is very helpful. We expect the cavity approximation to be 
faithful as long as crystalline, or more complex, spatially 
ordered structures are suppressed by the randomness of 
the chain sequence. 

We show that for fixed monomer-monomer interac- 
tions, both types of glass transition can be found. De- 
pending on the correlations in the chain sequence, one 
finds either a transition to a 'frozen glass' phase, with 
a jump in qi (we shall call Td — l//3d the correspond- 
ing transition temperature), or a continuous transition 
to a 'soft glass' phase, where qi develops continuously 
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from zero (we denote by Pi the inverse critical tempera- 
ture in this case), see Figs. ^ and EJ The predicted soft 
glass phase bears some resemblance with the intermedi- 
ate glass phase found in the dynamic approaches [lo| . 

Consider a lattice polymer whose conformation is given 
by a self-avoiding walk. Two monomers interact with 
an energy e„^„i if they occupy two neighboring sites i 
and j. Here a, a' G {1,-1} <-*■ {A^B} denote the 
monomer types. We will restrict to the symmetric case 
eAA = G-BB = -~&AB and only consider neutral polymers 
(equal number of A's and S's). The case caa = — 1 is 
close to the popular HP-model for proteins where hy- 
drophobic and polar constituents tend to cluster, while 
eAA = 1 corresponds to charged polymers (ampholytes) 
with short-range interactions. 

On a lattice with V sites, we consider the polymer in 
the grand canonical ensemble with a chemical potential 
/i per monomer. One can identify two phases depending 
on the scaling of the average length {N) of the polymer 
[l5| . For small /i (/i < ^ic{T)) there exists an infinitely 
diluted phase where {N)/V as y ^ cx3. For /i > n^, 
there is a dense phase with a finite (N) /V. To describe 
a polymer in equilibrium with the solvent the chemical 
potential has to be adjusted to the critical value jic- The 
0-point where the polymer collapses to the liquid globule 
occurs as a tricritical point on this line [3] , see Fig. ^ 

Before explaining the cavity approach, we notice that 
its approximations become exact on a 'Bethe lattice' 
whose appropriate definition for a frustrated system is 
a random graph with N vertices and fixed connectivity 
(taken equal to /c -I- 1 = 6 here to mimic cubic lattices) 
|14j . Since the size of typical loops in such a graph di- 
verges as 0(log(A'')), the graph is locally tree-hke, allow- 
ing for iterative solutions of the statistical physics prob- 
lem. 

For technical reasons we work with a periodic sequence 
of monomers with period L and take the limit L — > oo 
in the end. Let us now consider an oriented edge of the 
graph, going from a 'root' site R to another site S. It can 
be in one of the following states: (0): neither root nor 
edge are occupied; {+,a) or (— ,a): the root is occupied 
by monomer a G [1,^] and the backbone continues on 
the edge, the sign ± indicating that S is occupied by 
monomer a ± 1; (2a): the root is occupied by monomer 
a, but the edge is free. We now exploit the local tree-like 
structure to recursively express the (Gibbs) probability 
Pa ^ of finding the oriented edge '0' in the conformation 
a G {0, (-f,a), (— ,a),2a} in terms of the corresponding 
probabilities p'"^} on the remaining edges (i = 1, . . . , /c) 
that are connected to the root of the edge '0': 

i=l a' — l 
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FIG. 1: The phase diagram in the grand canonical ensem- 
ble. An unconstrained polymer lives on the coexistence line 
(fi — /ic(r)). In correlated HP-polymers and anticorrelated 
ampholytes (top) a continuous glass transition occurs at the 
liquid instability. It is followed by a discontinuous freezing 
transition. Oppositely correlated sequences (bottom) exhibit 
a direct freezing transition from the liquid globule. 

= t^-^e^-^p« ^,n^«, (2) 

i=l j^i 

where C is a normalization and we have introduced 

i^i'^=pi'^+j:^=A'y'-'-- 

The recursion is exact on a tree, but it also holds 
asymptotically on a large N Bethe lattice, in the liquid 
phase or inside a pure state where the probabilities on 
the k parent links are uncorrelated. 

The liquid phase is described by a fixed point pi' ■* = p* 
of Eqs. The liquid free energy cjinq is obtained as 

a sum of site and edge contributions which are expressed 
in terms of {p^} following the general prescription of 
Ref. Q. It turns out that the liquid globule of neu- 
tral copolymers bears no trace of disorder since (jjuq is 
independent of the sequence. 

For any heterogeneous sequence the entropy of the liq- 
uid solution turns negative at sufficiently low temper- 
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atures, indicating the existence of a glass phase which 
breaks the translational invariance. The latter is also 
clearly observed in Monte Carlo simulations both on the 
Bethe and the cubic 3d lattice 0|. To treat the glass 
phase, the above framework has to be extended. We 
assume that below the glass transition the phase space 
splits into a large number of pure states. Restricting 
the Gibbs measure to a state 7 gives rise to probabilities 
P'a'^\ and we again obtain a closed recursion relation if 
we look at the distribution of local probabilities over all 
pure states Q, /o(p) = A^-^ w^5{p - p(*'T)). It will 
not depend on the site i since the graph is regular. The 
cavity recursion then reads 

'^(p)=^ /n pip^'^w^ <5(p-p(o)[pw,...,p«]) 



-m/3A/[p< 



(4) 



where p*'^' [p*^^-* , . . . , p'^'^-'] is the pure state recursion (|1I3|I 



W,...,pW]., 

We have introduced in Q a reweighting with respect 
to the free energy change A/ corresponding to the ad- 
dition of the site 0, in order to take into account the 
exponential increase of the number of pure states as a 
function of the total free energy. A/ is related to the 
normalization factor in pi3|l by exp(— /3A/) = C. The 
reweighting depends on m € [0, 1] which is the cavity ana- 
log of the breakpoint parameter appearing in Parisi's one- 
step replica symmetry breaking scheme. As explained in 
[ij, [l?! , one can write down an m-dependent free energy 
(j)i{m), and the dominant metastable states correspond 
to m* which maximizes 4>i(m). 

The local stability of the liquid phase with respect to 
the glass transition can be studied by computing appro- 
priately defined glass susceptibilities. The simplest one 
can be written by associating to each site i of the lattice 
a 'spin' Si. We define Si = if i is empty and s,; — a, if i 
is occupied by a monomer of type a. The susceptibility 
is given by Xg = V~^Y.ij (siSj)'^- A divergence of Xg 
signals the appearance of a glass phase. 

The instability of the liquid can be analyzed for arbi- 
trary sequences of monomers. It occurs at the tempera- 
ture Pi determined by 

1 + Z2{(3,) cosh(A) ^ _^i/2 , 2(fc - l)Qseq 



Z2{P 

where Z: 
quence 



sinh(/3j) 
) 



fcl/2(l_fc-i/2)' 

is independent of the 
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Lp2/p^ IS independent ot the se- 
The latter only enters through the term 

Qsoq = J2b=o ^b{±y/k)~'' where qt = i/LJ2a=i'^a'^<^+b 
is the sequence autocorrelation function and the up- 
per/lower sign corresponds to the HP-model and the am- 
pholyte, respectively. Equation lO can be regarded as 
the generalization of a classic result by de Gennes IS] to 
strongly frustrated systems. 

In Fig. 12 we plot the local instability /3i obtained from 
Eq. |SJ) as a function of sequence correlations in the sim- 
ple case of Markovian chains described by the probability 
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FIG. 2: The phase diagram as a function of sequence correla- 
tions TT for HP polymers. At high temperature (small /3) the 
polymer is found in a liquid globule phase. For correlated se- 
quences (tt > 0.5), the molecule undergoes a continuous phase 
transition at inverse temperature Pi given by 0, followed 
by a discontinuous freezing at Pd- Anticorrelated sequences 
(tt < 0.5) freeze discontinuously at /3d without intermediate 
phase. The inverse temperature of abrupt freezing, Pd, turns 
out to be nearly independent of the sequence, as indicated by 
the horizontal line at Pd ~ 1.23. The data points show the 
Pd's computed for a few particular sequences with L = 20. 



TT for two consecutive monomers to be equal. If the insta- 
bility occurs too close to the entropy crisis of the liquid, 
it will be prevented by a discontinuous transition at (3d 
where a non-trivial solution of the cavity equation Q 
appears. This effect can be studied by a numerical solu- 
tion of this equation using a population dynamics method 
[1^ IT^ . Upon increasing the period L, we found the 
transition points to approach a correlation-independent 
constant f3d ~ 1.23 ± 0.03 that we believe to be a good 
approximation in the large L limit, see Fig. |21 

For ampholytic sequences with a bias to alternation, 
TT < 0.5, and for HP-like polymers with a preference for 
blocks, TT > 0.5, there is a continuous glass transition at 
Pi < Pd as predicted by Q. It is followed by a discon- 
tinuous freezing transition at lower temperature Pd- In 
copolymers with opposite correlations, the liquid insta- 
bility is irrelevant and the freezing transition occurs di- 
rectly from the liquid globule phase. The grand-canonical 
phase diagrams for the two scenarios are presented in 
Fig- n for two fixed values of tt. 

At the continuous phase transition Pi, the distribu- 
tion p(p) continuously departs from the liquid fixed point 
<^(P ~ P*)- The transition is of third order in the free 
energy, and as usual, the presence of a continuous transi- 
tion indicates that the replica symmetry is fully broken. 
The states will organize in an ultrametric hierarchy of 
clusters with small 'preferences' towards some particu- 
lar conformations. On lowering the temperature, these 
preferences grow and result in increasing correlations at 
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FIG. 3: Temperature dependence of the correlation plateaux 
qi, for three types of correlations with I = 1,2,3, computed 
using the cavity method. The main graph shows the con- 
tinuous transition of strongly correlated HP sequences. The 
inset displays the discontinuous transition of anticorrelated 
HP sequences. 

larger distances. This effect might be related to the scale- 
dependent freezing predicted in 0, and the interme- 
diate glass phase found in the numerics of [loj . 

The frozen phase at low temperature is of rather dif- 
ferent character. The site probabilities are strongly 
biased towards one particular site-dependent conforma- 
tion. The pure states correspond to almost frozen con- 
formations and have very small internal entropy, as in 
the REM. 

The freezing phenomenon can be quantitatively de- 
scribed by the correlation plateaux qi defined above. In 
Fig. Owe present our analytical results for qi{T) for two 
sequences with opposite correlations. 

What are the modifications of the above mean field 
picture for realistic (finite-dimensional) models? At least 
two features must be seriously reconsidered. First, within 
our approximation, the formation of ordered structures 
is generally inhibited. Let us consider, for instance, the 
HP copolymer with highly correlated sequences tt « 1. 
In dimension £), at low temperature, the system can 
separate into A-rich and B-rich regions of linear size 
I ^ log(l — tt)^/^, with a frustration energy of order 
per monomer. The phase diagram in Fig. [5] must there- 
fore be modified in the tt « 1 (for the HP model) and 
TT « (for the ampholyte) regions due to the emergence 
of spatially ordered phases. A proper treatment of these 
phenomena (which are crucial for block copolymers 0) 
is beyond the scope of this paper. 

Second, the discontinuous dynamical phase transition 
to the frozen glass phase cannot survive in finite di- 
mensions because of nucleation phenomena. Accord- 
ing to the usual scenario of discontinuous glass transi- 
tions, we expect it to become a sharp crossover associated 



with the emergence of very large ('activated') relaxation 
times \n\. 

In conclusion, we have described a new approach to 
heteropolymeric systems. The mean-field approximation 
is introduced in a well-defined and consistent way, and 
any further calculation can be checked through simula- 
tions on the Bethe lattice. The flexibility of the approach 
should allow for many applications (e.g., designed se- 
quences) and further refinements (e.g., clusters of sites 
in the cavity). We considered the case of copolymers 
with correlated sequences and found that the frozen glass 
phase can be preceded by a soft glass phase with very 
different dynamical properties. Since the sequence corre- 
lations turned out to be the decisive parameter, it would 
be interestin g to analyze protein sequences in the light of 
our findings [l^ . 
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